NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

hu.MAP3.0: Atlas of human protein complexes by integration of > 25,000 proteomic experiments

https://doi.org/10.1101/2024.10.11.617930

Fischer, Samantha N; Claussen, Erin R; Kourtis, Savvas; Sdelci, Sara; Orchard, Sandra; Hermjakob, Henning; Kustatscher, Georg; Drew, Kevin (October 2024, bioRxiv)

Abstract Macromolecular protein complexes carry out most functions in the cell including essential functions required for cell survival. Unfortunately, we lack the subunit composition for all human protein complexes. To address this gap we integrated >25,000 mass spectrometry experiments using a machine learning approach to identify > 15,000 human protein complexes. We show our map of protein complexes is highly accurate and more comprehensive than previous maps, placing ∼75% of human proteins into their physical contexts. We globally characterize our complexes using protein co-variation data (ProteomeHD.2) and identify co-varying complexes suggesting common functional associations. Our map also generates testable functional hypotheses for 472 uncharacterized proteins which we support using AlphaFold modeling. Additionally, we use AlphaFold modeling to identify 511 mutually exclusive protein pairs in hu.MAP3.0 complexes suggesting complexes serve different functional roles depending on their subunit composition. We identify expression as the primary way cells and organisms relieve the conflict of mutually exclusive subunits. Finally, we import our complexes to EMBL-EBI’s Complex Portal (https://www.ebi.ac.uk/complexportal/home) as well as provide complexes through our hu.MAP3.0 web interface (https://humap3.proteincomplexes.org/). We expect our resource to be highly impactful to the broader research community.
more » « less
Full Text Available
Building an Ethical and Trustworthy Biomedical AI Ecosystem for the Translational and Clinical Integration of Foundation Models

https://doi.org/10.3390/bioengineering11100984

Sankar, Baradwaj Simha; Gilliland, Destiny; Rincon, Jack; Hermjakob, Henning; Yan, Yu; Adam, Irsyad; Lemaster, Gwyneth; Wang, Dean; Watson, Karol; Bui, Alex; et al (October 2024, Bioengineering)

Foundation Models (FMs) are gaining increasing attention in the biomedical artificial intelligence (AI) ecosystem due to their ability to represent and contextualize multimodal biomedical data. These capabilities make FMs a valuable tool for a variety of tasks, including biomedical reasoning, hypothesis generation, and interpreting complex imaging data. In this review paper, we address the unique challenges associated with establishing an ethical and trustworthy biomedical AI ecosystem, with a particular focus on the development of FMs and their downstream applications. We explore strategies that can be implemented throughout the biomedical AI pipeline to effectively tackle these challenges, ensuring that these FMs are translated responsibly into clinical and translational settings. Additionally, we emphasize the importance of key stewardship and co-design principles that not only ensure robust regulation but also guarantee that the interests of all stakeholders—especially those involved in or affected by these clinical and translational applications—are adequately represented. We aim to empower the biomedical AI community to harness these models responsibly and effectively. As we navigate this exciting frontier, our collective commitment to ethical stewardship, co-design, and responsible translation will be instrumental in ensuring that the evolution of FMs truly enhances patient care and medical decision-making, ultimately leading to a more equitable and trustworthy biomedical AI ecosystem.
more » « less
Full Text Available
Complex portal 2025: predicted human complexes and enhanced visualisation tools for the comparison of orthologous and paralogous complexes

https://doi.org/10.1093/nar/gkae1085

Balu, Sucharitha; Huget, Susie; Medina Reyes, Juan_Jose; Ragueneau, Eliot; Panneerselvam, Kalpana; Fischer, Samantha_N; Claussen, Erin_R; Kourtis, Savvas; Combe, Colin W.; Meldal, Birgit_H_M; et al (November 2024, Nucleic Acids Research)

Abstract The Complex Portal (www.ebi.ac.uk/complexportal) is a manually curated reference database for molecular complexes. It is a unifying web resource linking aggregated data on composition, topology and the function of macromolecular complexes from 28 species. In addition to significantly extending the number of manually curated complexes, we have massively extended the coverage of the human complexome through the incorporation of high confidence assemblies predicted by machine-learning algorithms trained on large-scale experimental data. The current content of the portal comprising 2150 human complexes has been augmented by 14 964 machine-learning (ML) predicted complexes from hu.MAP3.0. We have refactored the website to enable easy search and filtering of these different classes of protein complexes and have implemented the Complex Navigator, a visualisation tool to facilitate comparison of related complexes in the context of orthology or paralogy. We have embedded the Rhea reaction visualisation tool into the website to enable users to view the catalytic activity of enzyme complexes.
more » « less
Addressing barriers in comprehensiveness, accessibility, reusability, interoperability and reproducibility of computational models in systems biology

https://doi.org/10.1093/bib/bbac212

Niarakis, Anna; Waltemath, Dagmar; Glazier, James; Schreiber, Falk; Keating, Sarah M; Nickerson, David; Chaouiya, Claudine; Siegel, Anne; Noël, Vincent; Hermjakob, Henning; et al (July 2022, Briefings in Bioinformatics)

Computational models are often employed in systems biology to study the dynamic behaviours of complex systems. With the rise in the number of computational models, finding ways to improve the reusability of these models and their ability to reproduce virtual experiments becomes critical. Correct and effective model annotation in community-supported and standardised formats is necessary for this improvement. Here, we present recent efforts toward a common framework for annotated, accessible, reproducible and interoperable computational models in biology, and discuss key challenges of the field.
more » « less
Full Text Available
The ProteomeXchange consortium in 2020: enabling ‘big data’ approaches in proteomics

https://doi.org/10.1093/nar/gkz984

Deutsch, Eric W; Bandeira, Nuno; Sharma, Vagisha; Perez-Riverol, Yasset; Carver, Jeremy J; Kundu, Deepti J; García-Seisdedos, David; Jarnuczak, Andrew F; Hewapathirana, Suresh; Pullman, Benjamin S; et al (November 2019, Nucleic Acids Research)

Abstract The ProteomeXchange (PX) consortium of proteomics resources (http://www.proteomexchange.org) has standardized data submission and dissemination of mass spectrometry proteomics data worldwide since 2012. In this paper, we describe the main developments since the previous update manuscript was published in Nucleic Acids Research in 2017. Since then, in addition to the four PX existing members at the time (PRIDE, PeptideAtlas including the PASSEL resource, MassIVE and jPOST), two new resources have joined PX: iProX (China) and Panorama Public (USA). We first describe the updated submission guidelines, now expanded to include six members. Next, with current data submission statistics, we demonstrate that the proteomics field is now actively embracing public open data policies. At the end of June 2019, more than 14 100 datasets had been submitted to PX resources since 2012, and from those, more than 9 500 in just the last three years. In parallel, an unprecedented increase of data re-use activities in the field, including ‘big data’ approaches, is enabling novel research and new data resources. At last, we also outline some of our future plans for the coming years.
more » « less
Full Text Available
BioSimulators: a central registry of simulation engines and services for recommending specific tools

https://doi.org/10.1093/nar/gkac331

Shaikh, Bilal; Smith, Lucian P; Vasilescu, Dan; Marupilla, Gnaneswara; Wilson, Michael; Agmon, Eran; Agnew, Henry; Andrews, Steven S; Anwar, Azraf; Beber, Moritz E; et al (May 2022, Nucleic Acids Research)

Abstract Computational models have great potential to accelerate bioscience, bioengineering, and medicine. However, it remains challenging to reproduce and reuse simulations, in part, because the numerous formats and methods for simulating various subsystems and scales remain siloed by different software tools. For example, each tool must be executed through a distinct interface. To help investigators find and use simulation tools, we developed BioSimulators (https://biosimulators.org), a central registry of the capabilities of simulation tools and consistent Python, command-line and containerized interfaces to each version of each tool. The foundation of BioSimulators is standards, such as CellML, SBML, SED-ML and the COMBINE archive format, and validation tools for simulation projects and simulation tools that ensure these standards are used consistently. To help modelers find tools for particular projects, we have also used the registry to develop recommendation services. We anticipate that BioSimulators will help modelers exchange, reproduce, and combine simulations.
more » « less
Full Text Available
SBML Level 3: an extensible format for the exchange and reuse of biological models

https://doi.org/10.15252/msb.20199110

Keating, Sarah M; Waltemath, Dagmar; König, Matthias; Zhang, Fengkai; Dräger, Andreas; Chaouiya, Claudine; Bergmann, Frank T; Finney, Andrew; Gillespie, Colin S; Helikar, Tomáš; et al (August 2020, Molecular Systems Biology)
null (Ed.)
Full Text Available

Search for: All records